Is Document Frequency Important for PRF?
نویسندگان
چکیده
We introduce in this paper a new heuristic constraint for PRF models, referred to as the Document Frequency (DF) constraint, which is validated through a series of experiments with an oracle. We then analyze, from a theoretical point of view, state-of-the-art PRF models according to their relation with this constraint. This analysis reveals that the standard mixture model for PRF in the language modeling family does not satisfy the DF constraint on the contrary to several recently proposed models. Lastly, we perform tests with a simple family of tf-idf functions based on a parameter controlling the satisfaction of the constraint. This last series of experiments further validate the DF constraint.
منابع مشابه
A Study of Document Weight Smoothness in Pseudo Relevance Feedback
In pseudo relevance feedback (PRF), the document weight which indicates how important a document is for the PRF model, plays a key role. In this paper, we investigate the smoothness issue of the document weights in PRF. The term smoothness means that the document weights decrease smoothly (i.e. gradually) along the document ranking list, and the weights are smooth (i.e. similar) within topicall...
متن کاملA comparative study on pile group and piled raft foundations (PRF) behavior under seismic loading
Study on the seismic behavior of piled rafts and pile groups while the same amount of construction material and excavation is used in their construction, are the main objective of this research. The process where the raft interaction with soil can affect the seismic response and stress distribution is also discussed in the current study. By means, ABAQUS software was applied for the finite elem...
متن کاملLearning-Based Pseudo-Relevance Feedback for Patent Retrieval
Pseudo-relevance feedback (PRF) is an effective approach in Information Retrieval but unfortunately many experiments have shown that PRF is ineffective in patent retrieval. This is because the quality of initial results in the patent retrieval is poor and therefore estimating a relevance model via PRF often hurts the retrieval performance due to off-topic terms. We propose a learning to rank fr...
متن کاملHigh frequency of +1 programmed ribosomal frameshifting in Euplotes octocarinatus.
Programmed -1 ribosomal frameshifting (-1 PRF) has been identified as a mechanism to regulate the expression of many viral genes and some cellular genes. The slippery site of -1 PRF has been well characterized, whereas the +1 PRF signal and the mechanism involved in +1 PRF remain poorly understood. Previous study confirmed that +1 PRF is required for the synthesis of protein products in several...
متن کاملA Variable Prf Imaging Method for High Squint Diving Sar
The high squint diving SAR is widely used to provide the information in advance. Large squint angle deduces the deeper coupling of range and azimuth of SAR echoes which makes SAR imaging more difficult. Especially, the large range migration of the deep couple heavily burdens the imaging processing time and storage units. The diving motion of platform worsens the situation. This paper proposes t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011